A Robust Feature Extraction for End Point Detection in the Nonstationary Noisy Environment

نویسندگان

Sangjun Park

Jungpyo Hong

Minsoo Hahn

چکیده

This paper proposes a robust feature extraction for voice activity detection (VAD) in noisy environments where nonstationary noises exist. The accuracy of the VAD is drastically reduced because the fluctuation of features at the noise intervals causes false alarm rate to being increased. In this paper, in order to improve the VAD accuracy, harmonic-weighted energy is proposed. This feature extraction method focuses on voiced speech interval and weights the amount of the ‘harmonicity’ to the averaged energy of the frame input. To evaluate the performance of the proposed feature extraction method, receiver operating characteristic curves and equal error rate are measured. From the results, it is proved that the proposed method is the discriminative feature for VAD. Keywords— voice activity detection, robust feature extraction, harmonic to noise ratio

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech Recognition in Noisy Environment Using Different Feature Extraction Techniques

In this paper, different feature extraction methods for speech recognition system such as Melfrequency cepstral coefficients (MFCC), linear predictive coefficient cepstrum (LPCC) and Bark frequency cepstral coefficients (BFCC) are implemented and the comparison is done based on average recognition accuracy. We suggest a noise robust isolated word speech recognition system which can be applied i...

متن کامل

Accurate Fault Classification of Transmission Line Using Wavelet Transform and Probabilistic Neural Network

Fault classification in distance protection of transmission lines, with considering the wide variation in the fault operating conditions, has been very challenging task. This paper presents a probabilistic neural network (PNN) and new feature selection technique for fault classification in transmission lines. Initially, wavelet transform is used for feature extraction from half cycle of post-fa...

متن کامل

روشی جدید در بازشناسی مقاوم گفتار مبتنی بر دادگان مفقود با استفاده از شبکه عصبی دوسویه

Performance of speech recognition systems is greatly reduced when speech corrupted by noise. One common method for robust speech recognition systems is missing feature methods. In this way, the components in time - frequency representation of signal (Spectrogram) that present low signal to noise ratio (SNR), are tagged as missing and deleted then replaced by remained components and statistical ...

متن کامل

Acoustic and Data-driven Features for Robust Speech Activity Detection

In this paper we evaluate different features for speech activity detection (SAD). Several signal processing techniques are used to derive acoustic features that capture attributes of speech useful in differentiating speech segments in noise. The acoustic features include short-term spectral features, long-term modulation features both derived using Frequency Domain Linear Prediction (FDLP), and...

متن کامل

Evaluation and optimization of noise robust front-end technologies for the automatic recognition of Hungarian telephone speech

In this paper a variety of front-end configurations are evaluated on Hungarian telephone speech databases. Our aim was to measure directly the efficiency of the front-ends on real noisy and normal speech data. As a baseline the ETSI ADSR standard front-end is used. Some simplification on the standard is introduced resulting in better performance on our databases than the original front-end in t...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2014

A Robust Feature Extraction for End Point Detection in the Nonstationary Noisy Environment

نویسندگان

چکیده

منابع مشابه

Speech Recognition in Noisy Environment Using Different Feature Extraction Techniques

Accurate Fault Classification of Transmission Line Using Wavelet Transform and Probabilistic Neural Network

روشی جدید در بازشناسی مقاوم گفتار مبتنی بر دادگان مفقود با استفاده از شبکه عصبی دوسویه

Acoustic and Data-driven Features for Robust Speech Activity Detection

Evaluation and optimization of noise robust front-end technologies for the automatic recognition of Hungarian telephone speech

عنوان ژورنال:

اشتراک گذاری